|
A restricted Boltzmann machine (RBM) is a generative stochastic artificial neural network that can learn a probability distribution over its set of inputs. RBMs were initially invented under the name Harmonium by Paul Smolensky in 1986, but only rose to prominence after Geoffrey Hinton and collaborators invented fast learning algorithms for them in the mid-2000s. RBMs have found applications in dimensionality reduction, classification, collaborative filtering, feature learning and topic modelling.〔Ruslan Salakhutdinov and Geoffrey Hinton (2010). (Replicated softmax: an undirected topic model ). ''Neural Information Processing Systems'' 23.〕 They can be trained in either supervised or unsupervised ways, depending on the task. As their name implies, RBMs are a variant of Boltzmann machines, with the restriction that their neurons must form a bipartite graph: a pair of nodes from each of the two groups of units, commonly referred to as the "visible" and "hidden" units respectively, may have a symmetric connection between them, and there are no connections between nodes within a group. By contrast, "unrestricted" Boltzmann machines may have connections between hidden units. This restriction allows for more efficient training algorithms than are available for the general class of Boltzmann machines, in particular the gradient-based contrastive divergence algorithm.〔Miguel Á. Carreira-Perpiñán and Geoffrey Hinton (2005). On contrastive divergence learning. ''Artificial Intelligence and Statistics''.〕 Restricted Boltzmann machines can also be used in deep learning networks. In particular, deep belief networks can be formed by "stacking" RBMs and optionally fine-tuning the resulting deep network with gradient descent and backpropagation. ==Structure== The standard type of RBM has binary-valued (Boolean/Bernoulli) hidden and visible units, and consists of a matrix of weights (size ''m''×''n'') associated with the connection between hidden unit and visible unit , as well as bias weights (offsets) for the visible units and for the hidden units. Given these, the ''energy'' of a configuration (pair of boolean vectors) is defined as : or, in matrix notation, : where is a partition function defined as the sum of over all possible configurations (in other words, just a normalizing constant to ensure the probability distribution sums to 1). Similarly, the (marginal) probability of a visible (input) vector of booleans is the sum over all possible hidden layer configurations:〔 : Since the RBM has the shape of a bipartite graph, with no intra-layer connections, the hidden unit activations are mutually independent given the visible unit activations and conversely, the visible unit activations are mutually independent given the hidden unit activations.〔 That is, for visible units and hidden units, the conditional probability of a configuration of the visible units , given a configuration of the hidden units , is :. Conversely, the conditional probability of given is :. The individual activation probabilities are given by : and where denotes the logistic sigmoid. The visible units of RBM can be multinomial, although the hidden units are Bernoulli. In this case, the logistic function for visible units is replaced by the Softmax function : where ''K'' is the number of discrete values that the visible values have. They are applied in topic modeling,〔 and recommender systems.〔 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Restricted Boltzmann machine」の詳細全文を読む スポンサード リンク
|